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Background of the Invention 

1. Technical Field 



The present invention relates in general to the Internet. More specifically, the 
present invention relates to the field web browsers. 



2. Background Art 



The development of the ED VAC computer system of 1948 is often cited as the 
beginning of the computer era. Since that time, computer systems have evolved into 
extremely sophisticated devices that may be found in many different settings. Computer 
systems typically include a combination of hardware (e.g., semiconductors, circuit boards, 
etc.) and software (e.g., computer programs). As advances in semiconductor processing 
and computer architecture push the performance of the computer hardware higher, more 
sophisticated computer software has evolved to take advantage of the higher performance 
of the hardware, resulting in computer systems today that are much more powerfiil that 
just a few years ago. 



Other changes in technology have also profoundly affected how we use 
computers. For example, this widespread proliferation prompted the development of 
computer networks that allow computers to communicate with each other. With the 
introduction of the personal computer (PC), computing became accessible to large 
numbers of people. Networks for personal computers were developed to allow individual 
users to communicate with each other. In this manner, a large number of people within a 
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company could communicate at the same time with a software appUcation running on one 
computer system. 

One significant computer network that has recently become very popular is the 
Internet. The Internet grew out of the modem proliferation of computers and networks, 
and has evolved into a sophisticated worldwide network of computer systems linked 
together that collectively make up the "world-wide web", or WWW. A user at an 
individual PC or workstation that wishes to access the WWW, typically does so using a 
software application known as a web browser. Web browsers communicate with 
computer systems called web servers. The communication between web browsers and 
web servers can be done according to any of several Internet protocols (e.g., hypertext 
transfer protocol (http); file transfer protocol (ftp); Internet Inter-Orb Protocol (HOP). 

Information transmitted fi-om the web server to the web browser is generally 
formatted using a specialized language called Hypertext Markup Language (HTML). 
HTML is a language that describes the structure of a document in a standard way that 
web browsers are able to understand and interpret. An HTML document stored on a web 
server, commonly called a "web page," can thus be downloaded by any web browser and 
displayed on the client machine. HTML allows web pages to be linked together using 
hypertext. Hypertext is a method of organizing text, graphics and other kinds of data that 
lets individual data elements point to one another. Thus, by using hypertext to link pages 
together web client users can selectively view pages that interest them. 

Many web pages use extensive graphics and other "components" to dress up their 
web page, where components are defined in this patent to be any additional items 
referenced in HTML documents. These components are included in an HTML document 
through the use of specialized tags. For example, the image tag <IMG> is used to insert a 
graphics image that is a component in the HTML document. Likewise, the applet tag 
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<Applet> is used to insert an applet, a small program designed to be downloaded and run 
on the web client, into the HTML document. 

When the web browser requests a page, the HTML document is first downloaded 
to the browser. The web browser then automatically requests the various components 
5 that are referenced in the HTML document. The server then downloads the components 
to the web browser so that the web browser can then display the completed web page to 
the user. This process happens automatically and without the web client user having any 
control over the process. 

In many cases, the extra components are not of interest to a particular web client 

10 user. For example, a component may comprise a graphics image advertisement for a 

\i product that does not interest the user. In another case, the component may comprise an 

W applet which performs a function that the user does not need or want to be performed. A 

j:g component could also comprise a form that the user does not need or wish to have. In all 

! p of these cases, the components comprise items which the user does not desire to 

iO 15 download. 

i = I 

JT Furthermore, these undesirable components can be very large. A typical graphics 

■ F image file could comprise hundreds of kilobytes. Downloading such unwanted graphics 

files may take considerable time to download depending on the users method of access to 

the Internet. During this time the user may be unable to proceed and is forced to wait for 
20 all the components to download. Thus, many users are forced to spend valuable time 

waiting for all components to download to their web browser before they can proceed 

with whatever task they have. 

Without an improved browser, web client users will have to continue to wait for 
the download of each and every unwanted component before being able to proceed with 
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what they want to do. It is easy to see that this situation is undesirable from the 
perspective of the web browser user. 



Summary of the Invention 

According to the present invention, a method and apparatus for selectively 
dovmloading components over the World- Wide- Web is provided. The preferred 
embodiment uses a download selection mechanism to display a download selection list to 
a web browser user. The list allows a web user to decide which components to download 
and which to not download, thereby giving the web user more control over the speed of 
their Internet accesses. The preferred method produces a table or chart of components 
and allows the web user to select which components are downloaded. 



Brief Description of the Drawings 

FIG. 1 is a block diagram view of a web client system in accordance with the 
preferred embodiment; 

FIG. 2 is a block diagram of a web client system and a web server system; 

FIG. 3 is a flow diagram representing a document dovmload method; 

FIG. 4 is a representative HTML document; 
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FIG. 5 is a schematic representation of the HTML document of FIG. 4 as it would 
be rendered by a typical web browser; 

FIG. 6 is a flow diagram of a method in accordance with the preferred 
embodiment; 

FIG. 7 is a schematic representation of the HTML document of FIG. 4 with a 
component download list in accordance with the first embodiment; 

FIG. 8 is a schematic representation of the HTML document of FIG. 4 with a 
component download list in accordance with the second embodiment; and 

FIG. 9 is a schematic representation of the HTML document of FIG. 4 with a 
component download list in accordance with the third embodiment. 
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Description of the Preferred Embodiments 



The present invention relates to the use of web browsers to download documents 
over the World- Wide- Web. For those that are not experts in the field, the Overview 
section below provides general background information on document transfers between 
web servers and web browsers over World- Wide- Web. 

Overview- World- Wide- Web Document Transfers 

Transfers between web servers and web browsers are usually in the form of the 
web browser requesting a web resource, and the web server downloading the requested 
resources to the web browser. Tuming to FIG. 3, an HTML download method 300 is 
illustrated. The download method 300 is consistent with typical web browser - web 
server interactions. 

The first step 302 is to establish a connection between the web browser and the 
web server. Typically, the web client user enters a Uniform Resource Locator (URL) for 
a desired web resource into his web browser. The URL of a resource can be manually 
typed into the computer or entered automatically through the use of a hypertext link on 
another web resource. A URL is a standard address used to identify resources that are 
accessible through the WWW. The URL of a resource describes the protocol needed to 
access it and points to its web server, Intemet location, and home directory. 

The web browser sends a message to the web server specified in the URL 
requesting that a connection be established. Web servers listen for requests that are 
directed to them and can recognize requests as being for one of its resources. The web 
server can either accept or reject the request for the connection, and sends a message to 
the web browser to that effect. Many web browsers display a message such as "Connect: 
contacting host www.server.com" while establishing a connection with the server. 
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When the connection is established, the next step 304 is for the web browser to 
send a request for a particular web resource. This involves sending the URL for the web 
resource to the web server. This is typically done automatically by the web browser after 
the connection is established. The web server receives the request and either delivers the 
requested web resource or sends an error message explaining why it caimot. If the web 
server sends the requested web resource, it will do so in the manner requested by the web 
browser. For example, if the URL included the term "http," the web server will send the 
web resource using the hypertext transfer protocol. 

The next step 306 in delivering the resource to the web browser involves sending 
the HTML document to the web browser. The HTML document includes the text and 
formatting information that determine the documents layout. A HTML document may 
also include references to one or more components. These components can include any 
item referenced in the HTML page to be downloaded and integrated with the page, such 
as graphics images, background images, audio, video and multimedia files, forms, applets 
etc. At this time typical web browsers display icons for graphic images and other 
components that are to be downloaded later. The icons are replaced by the component 
once the relevant files have been received. 

Tuming briefly to FIG. 4, an example HTML document 400 is illustrated. The 
HTML document 400 is a web page advertising "Walts World of Melons." The HTML 
document 400 is illustrative of the kind of features that are typically found in a web page, 
although actual HTML documents are likely to be much more complex. 

The HTML document 400 includes references to five components: an applet, 
three images and a background image, hi particular, an applet (a small program designed 
to be downloaded and run on the web client) called "DancingMelon.class" is referenced 
by the <APPLET> tag. The applet tag specifies the location of the applet on the web 
server, and causes the web server to download the specified applet to the web browser. 
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Likewise, three images are referenced in the document. Two of the images, 
watermelon.gif and Honeydew.gif are Graphics Image Format (GIF) files. The other, 
cantalope.jpg is a Joint Photographic Expert's Group (JPEG) file. The images are 
referenced by the <IMG> tag, and include the URL for the image file, location 
information and alternative reference title for the image. An additional image, 
background.jpg is referenced for use as a background component in the web page. As 
stated before, there are many more components which can be referenced in an HTML 
document, and these four are simply illustrative of the several types commonly found. 

Retuming to FIG. 3, in the next step 308, the web browser automatically requests 
the components referenced in the HTML document firom the web server. This is done by 
sending the URL for the component, which was contained in the components tag, to the 
web server. The web server then delivers the requested components to the web browser. 

In the next step, the web browser renders the components and displays the HTML 
page with the embedded components to the user. Turning briefly to FIG. 5, the display 
screen 500 illustrates how the HTML document 400 is displayed by a typical web 
browser. The display screen includes HTML text and the components referenced in 
HTML document 400. In particular, the DancingMelon applet 502, the watermelon GIF 
image 504, the cantaloupe JPEG image 506 and the honeydew GIF image 508 would all 
be downloaded and displayed by the web browser. Additionally, the background GIF 
component, not illustrated in FIG. 5, would be downloaded and displayed by the browser. 
The web browser thus automatically downloads all components referenced in the HTML 
document. 

Detailed Description 

Referring now to FIG. 2. a client system 200 and a server system 100 are shown to 
illustrate a preferred embodiment of the present invention. The client system 200 is 
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connected to the server system 100 over network connection 106. The network 
connection 106 can be any network mechanism, such as a local area network (LAN), a 
Mdde-area-network (WAN), or intranet connection, but is preferably an Internet 
connection allov^ng the client system 200 to communicate with the server system 100 
over the World- Wide- Web. 

The server system 100 includes a web server application 102. The web server 
application 102 communicates with web browser 224 on the client system 200. The web 
server application 102 delivers files to the web browser 224 over connection 106 when 
requested by the web browser 224. The web server application 102 can comprise any 
suitable web server application such as IBM Internet Connection Server, Netscape Web 
Server, etc. 

The client system 200 can be any suitable client system, such as an IBM 
compatible personal computer. The client system 200 includes a web browser application 
224. In accordance with the preferred embodiment, the web browser application 224 
includes an HTML component download selection mechanism 226. The HTML 
component download selection mechanism 226 allows a user of web browser 224 to 
select which HTML components are downloaded from the web server application 102 by 
supplying a download selection list to the web user. 

Turning now to FIG. 1, a block diagram of client computer system 200 in 
accordance with the present invention is illustrated. The computer system 200 can be any 
suitable system, such as an IBM compatible personal computer. However, those skilled 
in the art will appreciate that the mechanisms and apparatus of the present invention 
apply equally to any computer system, regardless of whether the computer system is a 
complicated multi-user computing apparatus or a single user personal computer. As 
shown in the block diagram of FIG. 1, computer system 200 comprises main or central 
processing unit (CPU) 202 connected to main memory 204, auxiliary storage interface 
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206, terminal interface 208, and network interface 210. These system components are 
interconnected through the use of a system bus 160. Auxiliary storage interface 206 is 
used to connect mass storage devices (such as DASD devices) to computer system 200. 

Main memory 204 contains an operating system 222 and a v^eb browser 224. In 
accordance with the preferred embodiment, the web browser 224 will include an HTML 
component download selection mechanism 226. HTML component download selection 
mechanism 226 to allows a user of the web browser 224 to select which, if any, of the 
HTML components embedded in a web page are to be downloaded. Computer system 
200 preferably utilizes well known virtual addressing mechanisms that allow the 
programs of computer system 200 to behave as if they only have access to a large, single 
storage entity instead of access to muhiple, smaller storage entities such as main memory 
204 and DASD devices. Therefore, while operating system 222 and web browser 224 are 
shown to reside in main memory 204, those skilled in the art will recognize that these 
programs are not necessarily all completely contained in main memory 204 at the same 
time. (It should also be noted that the term "computer system memory" is used herein to 
generically refer to the entire virtual memory of computer system 200.) 

Operating system 222 can be any operating system, such as OS/2, Windows, AIX, 
OS/400 etc, and those skilled in the art will appreciate that the spirit and scope of the 
present invention is not limited to any one operating system. 

Although computer system 200 is shown to contain only a single main CPU and a 
single system bus, those skilled in the art will appreciate that the present invention may 
be practiced using a computer system that has multiple CPUs and/or multiple buses. 

Terminal interface 208 is used to directly connect one or more terminals to 
computer system 200. These terminals may be non-intelligent or fully programmable 
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workstations, are used to allow system administrators and users to communicate with 
computer system 200. 

Network interface 210 is used to connect other computer systems and/or 
workstations to computer system 200 in networked fashion. In the preferred embodiment 
the network interface 210 includes a connection to the Internet and the World- Wide- Web, 
but could also be to connect to other networked environments, such as internal web-based 
systems (typically called intranets). The present invention applies equally no matter how 
computer system 200 may be connected to other computer systems and/or workstations, 
regardless of whether the connection is made using present-day analog and/or digital 
techniques or via some networking mechanism of the future. 

It is important to note that while the present invention has been (and will continue 
to be) described in the context of a fully functional computer system, those skilled in the 
art will appreciate that the mechanisms of the present invention are capable of being 
distributed as a program product in a variety of forms, and that the present invention 
applies equally regardless of a particular type of signal bearing media used to actually 
carry out the distribution. Examples of signal bearing media include: recordable type 
media such as floppy disks, CD-ROMs and transmission type media such as digital and 
analog communication links. 

Turning to FIG. 6, an HTML download method 900 in accordance with the 
preferred embodiment is illustrated. The download method 900 gives the web client user 
the choice of which HTML components to download, and thus allows the user to save 
time by not downloading components which are not of interest. 

The first step 902 is to establish a connection between the web browser 224 and 
the web server 102. Typically, the web client user enters a Uniform Resource Locator 
(URL) for a desired web resource into his web browser. The web browser then sends a 
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message to the specified web server specified in the URL requesting that a connection be 
established. The web server can either accept or reject the request for the connection, and 
sends a message to the web browser to that effect. 

When the connection is estabUshed, the next step 904 is for the web browser to 
send a request for a particular web resource. This involves sending the URL for the web 
resource to the web server, which is typically done automatically by the web browser after 
the connection is established. The web server receives the request and either delivers the 
requested web resource or sends an error message explaining why it cannot. 

The next step 906 in delivering the resource to the web browser involves sending 
the HTML document to the web browser. The HTML document includes the text and 
formatting information that determines the documents layout. A HTML document may 
also include references to one or more components. These components can include any 
item referenced in the HTML page to be dovmloaded and integrated with the page, such 
as graphics images, background images, audio, video and multimedia files, forms, applets 
etc. 

In accordance with the preferred embodiment, the component download selection 
mechanism 226 operates to give the web client user the choice as to which components to 
download. The web browser parses the references to the components from the HTML 
document and, in step 908, the web browser requests input from the web user as to which, 
if any, of the referenced components the web user would like to download. In the 
preferred embodiment, the request to the web user occurs dynamically, e.g., automatically 
after the HTML document is downloaded, hi the next step 910 the web browser requests 
from the web server all components selected by the web user. The web server then 
delivers the requested components or sends an error message indicating it could not fiilfill 
the request. 



IBM Docket R0996-141 



12 



There are several embodiments that can be used to implement steps 908 and 910 
of method 900. For example, turning to FIG. 7, the display screen 700 illustrates how 
the HTML document 400 of FIG. 4 would be displayed by a web browser in accordance 
with the first embodiment. In this embodiment after downloading the HTML document 
400 the web browser displays two panes of information. In the first pane 702, the web 
browser displays the text portions of the HTML document 400. In the second pane 704, 
the component download selection mechanism causes the web browser to display a 
component download list to allow users to select which components to download. The 
component download list illustrated in FIG. 7 includes the file name, the length of the file 
and the number of bytes to transfer, and the percentage of bytes that have been 
transferred. The list is preferably ordered to reflect the order of downloading. In the 
illustrated embodiment, the component download selection mechanism allows users to 
select which components are to be downloaded and does not download any components 
until selected by the user. In particular, the component download list includes a select 
button for each component which allows a user to select which components to download. 

Of course, this is just one way the component download list could be 
implemented. For example, the component download list could include other information 
about the components, such as the title or type. Specifying files by type in the component 
download list would allow a user to quickly identify certain file types (such as image 
files) that typically take much longer to download than other types of components. 
Likewise, the component download list could use color or other means to distinguish 
components that are being downloaded, those that have been dovmloaded, and those that 
have not been selected to be downloaded. 

In a variation on this embodiment, the component download selection mechanism 
begins to automatically download components while allowing users to stop a download 
by selecting an appropriate item on the component download list. 
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Turning now to FIG. 8, a dialog box 800 illustrates how the HTML document 400 
would be displayed by a web browser in accordance with a second embodiment. A dialog 
box is a type of pop-up window used to request input from the user. In this embodiment 
after downloading the HTML document 400 the web browser opens a component 
5 download list in a separate dialog box which is displayed to the user. Again, the 

component download list in dialog box 700 allows users to select which components to 
download. The component download list illustrated in FIG. 8 includes the component 
file name, the title of the component, the component type, the length of the file and the 
number of bytes transfer and the percentage of bytes that have been transferred. In this 
10 illustrated embodiment, the component download selection mechanism allows users to 

select which components are to be downloaded by, for example, selecting the component 
file with an input device such as a mouse, and does not dovmload any components until 
selected by the user. 

This is just one way in which a component download list could be implemented 
15 in a dialog box. For example, the component dovraload list could include other 

information about the components, such as the title or type. Likewise, the component 
download list could use color or other means to distinguish components that are being 
dovmloaded, those that have been downloaded, and those that have not been selected to 
be dovraloaded. 

20 Turning to FIG. 9, the display screen 900 illustrates how the HTML document 

400 would displayed by a web browser in accordance v^th the third embodiment. In this 
embodiment after downloading the HTML document 400 the web browser displays the 
document along with component download information for the various HTML 
components along with a selection button for each component. The component download 

25 information illustrated in FIG. 9 includes the file name, the number of bytes and the 

percentage of bytes that have been transferred. In the illustrated embodiment, the 
component download selection mechanism allows users to select which components are 
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not to be downloaded by hitting the its associated stop button. In variation on this 
embodiment, the stop buttons are replaced by "get" buttons that, when selected, cause the 
browser to request the associated component. 

Thus, the preferred method and apparatus provide a mechanism for selectively 
downloading components over the World- Wide- Web. The preferred embodiment allows 
a web user to decide which components to download and which to not, thereby giving the 
web user more control over the speed of their Internet accesses. The preferred method 
produces a table or chart of components and allows the web user to select which 
components are downloaded. 
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